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ASSAYS FOR THE IDENTIFICATION OF COMPOUNDS WHICH INHIBIT 
ACTIVATION OF CAMP AND MITOGEN RESPONSIVE GENES 
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5 rights in this invention. 

FIELD OF THE INVENTION 

The present invention relates to analytical 
methods. In a particular aspect, the present invention 
relates to methods for the identification of compounds 

10 which mediate the interaction between signal dependent 
transcription factors and co-factor protein (s) involved in 
the activation of transcription. In another aspect, the 
present invention relates to methods for the identification 
of new signal dependent transcription factors. In yet 

15 another aspect, the present invention relates to methods 
for the identification of novel co-factor protein(s) which 
mediate the interaction between signal dependent 
transcription factors and inducer molecules involved in the 
activation of transcription. 

20 BACKGROUND OF THE INVENTION 

Many eukaryotic genes are regulated in an 
inducible, cell type-specific fashion. Genes expressed in 
response to heat shock, steroid/thyroid hormones, phorbol 
esters, cyclic adenosine monophosphate (cAMP) , growth 

25 factors and heavy metal ions are examples of this class. 
The activity of cells is controlled by external signals 
that stimulate or inhibit intracellular events. The 
process by which an external signal is transmitted into and 
within a cell to elicit an intracellular response is 

30 referred to as signal transduction. Signal transduction is 
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generally initiated by the interaction of extracellular 
factors (or inducer molecules, i.e., growth factors, 
hormones, adhesion molecules, neurotransmitters, and other 
mitogens) with receptors at the cell surface. 
5 Extracellular signals are transduced to the inner face of 
the cell membrane, where the cytoplasmic domains of 
receptor molecules contact intracellular targets. The 
initial receptor-target interactions stimulate a cascade of 
additional molecular interactions involving multiple 
10 intracellular pathways that disseminate the signal 
throughout the cell. 

Many of the proteins involved in signal 
transduction contain multiple domains. Some of these 
domains have enzymatic activity and some of these domains 
15 are capable of binding to other cellular proteins, DNA 
regulatory elements, calcium, nucleotides, lipid mediators, 
and the like. 

Protein-protein interactions are involved in all 
stages of the intracellular signal transduction process - 

2 0 at the cell membrane, where the signal is initiated in the 
cytoplasm by receptor recruitment of other cellular 
proteins, in the cytoplasm where the signals are 
disseminated to different cellular locations, and in the 
nucleus where proteins involved in transcriptional control 

2 5 congregate to turn on or turn off gene expression. 

Mitogenic signaling affects the transcriptional 
activation of specific sets of genes and the inactivation 
of others. The nuclear effectors of gene activation are 
transcription factors that bind to DNA as homomeric or 
30 heteromeric dimers. Phosphorylation also modulates the 
function of transcription factors, as well. Oncogenes, 
first identified as the acute transforming genes transduced 
by retroviruses, are a group of dominantly acting genes. 
Such genes, which are involved in cell division, encode 
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growth factors and their receptors , as well as second 
messengers and mitogenic nuclear proteins activated by 
growth f actors . 

The binding of growth factors to their respective 
5 receptors activates a cascade of intracellular pathways 
that regulate phospholipid metabolism, arachidonate 
metabolism, protein phosphorylation, calcium mobilization 
and transport, and transcriptional regulation. Specific 
phosphorylation events mediated by protein kinases and 

10 phosphatases modulate the activity of a variety of 
transcription factors within the cell. These signaling 
events can induce changes in cell shape, mobility, and 
adhesiveness, or stimulate DNA synthesis. Aberrations in 
these signal-induced events are associated with a variety 

15 of hyperprolif erative diseases ranging from cancer to 
psoriasis. 

The ability to repress intracellular signal- 
induced response pathways is an important mechanism in 
negative control of gene expression. Selective disruption 

20 of such pathways would allow the development of therapeutic 
agents capable of treating a variety of disease states 
related to improper activation and/ or expression of 
specific transcription factors. The present invention 
satisfies this need and provides related advantages as 

25 well. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, it has 
been discovered that CREB binding protein (CBP) cooperates 
with upstream activators involved in the activation of 
3 0 transcription by signal dependent transcription factors, 
such as c-Jun (responsive to phorbol ester) , serum response 
factor, and the like. Accordingly, assays employing CBP 
have been developed for the identification of compounds 
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which disrupt the ability of signal dependent transcription 
factors to activate transcription* In another aspect, 
assays employing CBP have been developed for the 
identification of new signal dependent transcription 
factors. In yet another aspect of the present invention, 
assays employing CBP have been developed for the 
identification of novel co-factor protein (s) which mediate 
the interaction between signal dependent transcription 
factors and inducer molecules involved in the activation of 
transcription. In still another aspect, an assay is 

provided to identify compounds which have the binding 
and/or activation properties characteristic of CREB binding 
protein. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a bar graph summarizing the 
injections described in Example 2. Each bar represents the 
percentage of positive cells expressing £-galactosidase 
from 2-3 experiments where 100-200 cells were injected in 
each experiment. [anti-CBP] denotes concentration of 
affinity purified CBP antiserum injected into cells. Right 
(hatched bars) indicate the percent lacZ positive cells 
after microinjection of CRE-lacZ reporter with CBP 
antiserum (anti-CBP) or control IgG (RblgG) . Preincubation 
of antisera with CBP peptide or non-specific ILS peptide 
(lmg/ml) was carried out as indicated. 

Figure 2 is a bar graph summarizing the results 
of CBP antisera injections, as described in Example 3. 
Bars represent the percentage of lacZ positive (blue) cells 
(mean + standard deviation) from 3-5 experiments where 100- 
200 cells were injected in each experiment. Injected cells 
were identified by immunofluorescence and/or lacZ staining. 
Reporter plasmid encoding the lacZ reporter was 
microinjected into NIH3T3 cells. CRE- , SRE-, TRE-lacZ 
reporter activities were determined after microinjected 
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cells were treated as described herein. CMV-, RSV-,- and 
SV4 0-lac2 reporter activities were measured in the absence 
of inducers. Hatched bars indicate % blue cells after 
microinjection with CBP antiserum. Solid bars indicate % 
5 blue cells following injection with control rabbit IgG 
(RblgG) . 

DETAILED DESCRIPTION OF THE INVENTION 

Cyclic AMP (cAMP) regulates the transcription of 
numerous genes through protein kinase-A (PK-A) mediated 

10 phosphorylation, at Serl33, of transcription factor CREB. 
Within the CREB protein, a 60 amino acid Kinase Inducible 
Domain (KID) mediates transcriptional induction by PK-A. 
Based on recent work describing a nuclear CREB Binding 
Protein (CBP) , which specifically interacts with the 

15 phosphorylated KID domain of CREB, it has been examined 
whether CBP is necessary for cAMP regulated transcription. 
Antisera against CBP have been found to completely inhibit 
transcription from a cAMP responsive promoter, but not from 
constitutively active promoters. Surprisingly, CBP has 

20 also been found to cooperate with upstream activators 
involved in phorbol ester and serum responsive 
transcription. It is demonstrated herein that recruitment 
of CBP to certain inducible promoters is intimately 
involved in transmitting inductive signals from 

2 5 phosphorylated, and thus activated, upstream factors to the 

RNA polymerase II complex. A number of analytical uses for 
CBP and CBP-like compounds based on these observations are 
described herein. 

In accordance with the present invention, there 

3 0 is provided a method for the identification of a compound 

which inhibits activation of cAMP and mitogen responsive 
genes, said method comprising: 
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monitoring expression of reporter in response to 
exposure to said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
5 out in the presence of : 

a signal dependent transcription 
factor, 

a polypeptide comprising at least 
amino acid residues 461-661 of 
10 the protein set forth in SEQ ID 

NO: 2, and 

a reporter construct comprising a 
reporter gene under the control 
of said signal dependent 
15 transcription factor. 

As employed herein, the phrase "cAMP and mitogen 
responsive genes" refers to early response genes which are 
activated in response to a diverse array of agents 
including mitogens, such as, growth factors, 

20 differentiation inducers and biomodulators . Examples of 
such agents include insulin-like growth factor (IGF-1) , 
erythropoietin (EPO) , nerve growth factor (NGF) , epidermal 
growth factor (EGF) , platelet-derived growth factor (PDGF) , 
fibroblast growth factor (FGF) , transforming growth factor 

25 £ (TGF6) , interferon, tumor necrosis factor (TNF) , 
inter leukins , granulocyte-macrophage colony-stimulating 
factor (GM-CSF) , G-CSF, prolactin, serotonin, angiotensin, 
bombesin, bradykinin, noradrenalin , putrescine, 

concanavalin A, various oncogenic agents including tumor 

3 0 viruses, UV irradiation, estrogen, progesterone, 
testosterone, and the like. 

Signal dependent transcription factors 
contemplated for use in the practice of the present 
invention include phosphorylation dependent activators such 
35 as Jun, Fos, and other early response genes such as Myc, 
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Myb, erbA, and Rel, serum responsive factor, Elk, as well 
as steroid hormone receptors (e.g., glucocorticoid receptor 
(GR) ) , and the like. 

Polypeptides employed in the invention assay 
5 function as co-factors by binding to the signal dependent 
transcription factor as a necessary component of a 
transcriptionally active complex. Examples of such 
co-factors include CBP (i.e., substantially the entire 
amino acid sequence set forth in SEQ ID NO: 2), a 

10 polypeptide comprising amino acid residues 1-661 as set 
forth in SEQ ID NO: 2, as well as functional fragments 
thereof, e.g., residues 461-661, and homologues thereof, 
such as those identified by the method described herein for 
the identification of compounds which have the binding 

15 and/ or activation properties characteristic of CREB binding 
protein. In accordance with one embodiment of the present 
invention, there are provided purified and isolated 
polypeptides, CBPs, that bind to a specific sequence within 
phosphorylated CREB. 

2 0 As used herein, the term "purified" means that 

the molecule is substantially free of contaminants normally 
associated with a native or natural environment. CREB 
binding protein, or functional fragments thereof, useful in 
the practice of the present invention, can be obtained by 

25 a number of methods, e.g., precipitation, gel filtration, 
ion-exchange, reversed-phase, DNA affinity chromatography, 
and the like. Other well-known methods are described in 
Deutscher et al. , Guide to Protein Purification: Methods 
in Enzymology Vol. 182, (Academic Press, 1990), which is 

30 incorporated herein by reference. 

Alternatively, a purified CBP, or functional 
fragment thereof, useful in the practice of the present 
invention, can also be obtained by well-known recombinant 
methods as described, for example, in Ausubel et al. , 



WO 95/21940 



PCI7US95/01325 



Current Protocols in Molecular Biology (Greene Publishing 
Associates, Inc. and John Wiley & Sons, Inc. 1993), also 
incorporated herein by reference. An example of 

recombinant means to prepare CBP, or functional fragments 
5 thereof, is to express nucleic acid encoding CBP, or 
functional fragment thereof, in a suitable host cell, such 
as a bacterial., yeast or mammalian cell, using methods well 
known in the art, and recovering the expressed protein, 
again using methods well known in the art. 

10 CBPs, and biologically active fragments thereof, 

useful in the practice of the present invention can also be 
produced by chemical synthesis. Synthetic polypeptides can 
be produced using Applied Biosystems, Inc. Model 43 OA or 
4 31A automatic polypeptide synthesizer and chemistry 

15 provided by the manufacturer* CBP, and biologically active 
fragments thereof , can also be isolated directly from cells 
which have been transformed with the expression vectors 
described below in more detail. 

The present invention also encompasses nucleic 

20 acids encoding CBP and functional fragments thereof. See, 
for example, SEQ ID NO:l. This invention also encompasses 
nucleic acids which encode substantially the entire amino 
acid sequence set forth in SEQ ID NO: 2 (for example, the 
nucleic acid sequence set forth in SEQ ID NO:l, as well as 

25 nucleic acid sequences which differ from that set forth in 
SEQ ID NO:l due to the degeneracy of the genetic code), 
nucleic acids which encode amino acid residues 1-661, as 
set forth in SEQ ID NO: 2, nucleic acids which encode amino 
acid residues 461-661, as set forth in SEQ ID NO: 2, as well 

3 0 as nucleic acids which encode substantially the same amino 
acid sequences as any of those referred to above, but which 
differ only by the presence of conservative amino acid 
changes that do not alter the binding and/or activation 
properties of the CBP or CBP-like polypeptide encoded 

3 5 thereby. 
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The invention further provides the above- 
described nucleic acids operatively linked to a promoter, 
as well as other regulatory sequences. As used herein, the 
term "operatively linked" means positioned in such a manner 
5 that the promoter will direct the transcription of RNA from 
the nucleic acid. Examples of such promoters are SP6, T4 
and T7. 

Vectors which contain both a promoter and a 
cloning site into which a piece of DNA can be inserted so 

10 as to be operatively linked to the promoter are well known 
in the art. Preferably, these vectors are capable of 
transcribing RNA in vitro or in vivo. Examples of such 
vectors are the pGEM series (Promega Biotech, Madison, WI) . 
This invention also provides a vector comprising a nucleic 

15 acid molecule such as DNA, cDNA or RNA encoding a CBP 
polypeptide. Examples of additional vectors useful herein 
are viruses, such as bacteriophages, baculoviruses and 
retroviruses, cosmids, plasmids, and the like. Nucleic 
acids are inserted into vector genomes by methods well 

20 known in the art. For example, insert and vector DNA can 
both be exposed to a restriction enzyme to create 
complementary ends on both molecules that base pair with 
each other and which are then joined together with a 
ligase. Alternatively, synthetic nucleic acid linkers that 

25 correspond to a restriction site in the vector DNA can be 
ligated to the insert DNA which is then digested with a 
restriction enzyme that recognizes a particular nucleotide 
sequence. Additionally, an oligonucleotide containing a 
termination codon and an appropriate restriction site can 

30 be ligated for insertion into a vector containing, for 
example, some or all of the following: a selectable marker 
gene, such as neomycin gene for selection of stable or 
transient transf ectants in mammalian cells; 
enhancer /promoter sequences from the immediate early gene 

35 of human CMV for high levels of transcription; 
transcription termination and RNA processing signals from 
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SV40 for mRNA stability; SV4 0 polyoma origins of 
replication and ColEl for proper episomal replication; 
versatile multiple cloning sites; and T7 and SP6 RNA 
promoters for in vitro transcription of sense and antisense 
5 RNA. Other means are available and can readily be accessed 
by those of skill in the art. 

Also provided are expression vectors comprising 
DNA encoding a mammalian CBP, or functional fragment 
thereof, adapted for expression in a bacterial cell, a 

10 yeast cell, a mammalian cell or other animal cell. Such 
vectors comprise the regulatory elements necessary for 
expression of the DNA in the bacterial, yeast, mammalian or 
animal cells. Regulatory elements are positioned relative 
to the DNA encoding the CBP polypeptide so as to permit 

15 expression thereof. Regulatory elements required for 
expression include promoter sequences to bind RNA 
polymerase and transcription initiation sequences for 
ribosome binding. For example, a bacterial expression 
vector includes a promoter such as the lac promoter and the 

2 0 Shine-Dalgarno sequence and the start codon AUG (Ausubel et 

al., supra 1993) for transcription initiation. Similarly 
a eukaryotic expression vector includes a heterologous or 
homologous promoter for RNA polymerase II, a downstream 
polyadenylation signal, the start codon AUG, and a 
25 termination codon for detachment of the ribosome. Such 
vectors can readily be obtained commercially or assembled 
by methods well known in the art, for example, the methods 
described above for constructing vectors in general. 
Expression vectors are useful to produce cells that express 

3 0 CBP or functional fragments thereof. 

As employed herein, the term "reporter construct" 
refers to a recombinant construct, for example, an 
expression vector comprising a reporter gene under the 
control of a signal dependent transcription factor. A 
35 signal which induces activation or inactivation of a target 
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gene induces the reporter gene to express an exogenous 
identifiable "signal". Expression of the reporter gene 
indicates that the target gene has been modulated. 
Exemplary reporter genes encode lucif erase, 
5 /?-galactosidase, chloramphenicol transferase, and the like. 
Exemplary reporter constructs useful in the practice of the 
present invention include CRE-lacZ, SRE-lacZ, TRE-lacZ, and 
the like. 

In practicing the assays of the present 
10 invention, reporter plasmid is introduced into suitable 
host cells, along with CBP or a CBP-like polypeptide (or a 
DNA construct encoding same) and signal dependent 
transcription factor. The transfected host cells are then 
cultured in the presence and absence (as a control) of 
15 test compound suspected of being capable of inhibiting 
activation of cAMP and mitogen responsive genes. Next the 
transfected and cultured host cells are monitored for 
induction (i.e., the presence) of the product of the 
reporter gene. 

20 Any cell line can be used as a suitable "host" 

for the invention assays. Presently preferred host cells 
for use in invention assays are HeLa and NIH3T3 cells. 

In accordance with the present invention, 
expression of the reporter gene can be monitored in a 

25 variety of ways. Immunological procedures useful for In 
vitro detection of a polypeptide in a sample include 
immunoassays that employ a detectable antibody. Such 
immunoassays include, for example, ELISA, Pandex 
microf luorimetric assay, agglutination assays, flow 

3 0 cytometry, serum diagnostic assays and immunohistochemical 
staining procedures which are well known in the art. An 
antibody can be made detectable by various means well known 
in the art. For example, a detectable marker can be 
directly or indirectly attached to the antibody. Useful 
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markers include , for example, radionuclides, enzymes, 
fluorogens, chromogens and chemiluminescent labels. 



Compounds which are capable of inhibiting 
activation of cAMP and mitogen responsive genes, and hence 
5 can be identified by the invention assay method, include 
antibodies raised against the binding domain of the protein 
set forth in SEQ ID NO: 2, antibodies raised against the 
binding domain of CBP-like compounds, and the like. 
Presently preferred antibodies are those raised against a 

10 polypeptide fragment comprising amino acid residues from 
about 4 61 up to 661 of the protein set forth in SEQ ID 
NO: 2; with antibodies raised against a polypeptide fragment 
comprising amino acid residues from about 634 up to 648 of 
the protein set forth in SEQ ID NO: 2 (this subfragment is 

15 also set forth specifically as SEQ ID NO: 3) being 
especially preferred. 

Antibodies contemplated for use in the practice 
of the present invention have specific reactivity with the 
above-described CBP or CBP-like compounds. Active antibody 
20 fragments are encompassed within the definition of 
"antibody." As used herein "specific reactivity" refers to 
the ability of an antibody to recognize and bind to an 
epitope on CBP or CBP-like compounds. Antibodies employed 
in the practice of the present invention can be produced by 

2 5 any method known in the art. For example, polyclonal and 

monoclonal antibodies can be produced by methods well known 
in the art, as described, for example, in Harlow and Lane, 
Antibodies: A Laboratory Manual (Cold Spring Harbor 
Laboratory 1988) , which is incorporated herein by 

3 0 reference. The above-described CBP or CBP-like compounds 

can be used as the immunogen in generating such antibodies. 
Altered antibodies, such as chimeric, humanized, CDR- 
grafted or bifunctional antibodies can also be produced by 
methods well known to those skilled in the art. Such 
3 5 antibodies can also be produced by hybridoma, chemical or 
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recombinant methodology described, for example in Ausubel 
et al., supra. The antibodies can be used for determining 
the presence of a CBP-derived polypeptide, for the 
purification of CBP-derived polypeptides, for in vitro 
5 diagnostic methods, and the like. 

In accordance with another embodiment of the 
present invention, there is provided a method for the 
identification of a compound which inhibits activation of 
cAMP and mitogen responsive genes, said method comprising: 

1° (1) contacting a test system with said compound 

under physiological conditions; and 

(2) monitoring expression of reporter in 
response to said compound, relative to 
expression of reporter in the absence of 
15 said compound, wherein said reporter is 

encoded by a reporter construct comprising 
a reporter gene under the control of a 
signal dependent transcription factor, and 

wherein said test system comprises: 
20 said signal dependent transcription 

factor, 

a polypeptide comprising at least 
amino acid residues 461-661 of 
the protein set forth in SEQ ID 
25 NO: 2, and 

said reporter construct. 

In accordance with yet another embodiment of the 
present invention, there is provided a method for the 
identification of a compound which promotes activation of 
30 cAMP and mitogen responsive genes, said method comprising: 
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monitoring expression of reporter in response to 
exposure to said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
5 out in the presence of: 

a signal dependent transcription 

factor, or 
a polypeptide comprising at least 
amino acid residues 461-661 of 
10 the protein set forth in SEQ ID 

NO: 2, and 
a reporter construct; 
wherein said reporter construct comprises a 
reporter gene under the control of a signal 
15 dependent transcription factor. 

In accordance with still another embodiment of 
the present invention, there is provided a method for the 
identification of a compound which has the binding and/or 
activation properties characteristic of CREB binding 
2 0 protein, said method comprising: 

monitoring expression of reporter in response to 
exposure to said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
25 out in the presence of: 

a signal dependent transcription 

factor, and 
a reporter construct , 
wherein said reporter construct comprises a 
30 reporter gene under the control of a signal 

dependent transcription factor. 



In accordance with a still further embodiment of 
the present invention, there is provided a method for the 
identification of a compound which has the transcription 
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activation properties characteristic of a signal dependent 
transcription factor, said method comprising: 

monitoring expression of reporter in response to 
exposure to said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
out in the presence of : 

a polypeptide comprising at least 
amino acid residues 461-661 of 
the protein set forth in SEQ ID 
NO: 2, and 
a reporter construct, 
wherein said reporter construct comprises a 
reporter gene under the control of a signal 
dependent transcription factor. 

The invention will now be described in greater 
detail by reference to the following non-limiting examples. 

EXAMPLE I 
Functional Properties of CBP 

20 To characterize the functional properties of CBP, 

rabbit CBP antiserum was developed against a fragment of 
CBP extending from amino acid residues 634-648 within the 
CREB binding domain of CBP (i.e., KVEGDMYESANSRDE ; SEQ ID 
NO: 3). Crude antiserum was affinity purified on a 

25 synthetic CBP peptide column, as described by Gonzalez et 
al., in Mol. and Cell Biol. 11 ( 3 ): 1306-1312 (1991), which 
is incorporated herein by reference. Far-Western and 
Western blot assays were performed as described by, for 
example, Chrivia et al., in Nature 365:855-859 (1993), also 

3 0 incorporated herein by reference. Thus, Western (CBP) and 
Far-Western ( 32 P-CREB) blot analysis of partially purified 
CBP protein from HeLa nuclear extract was carried out 
following SDS-PAGE and transfer to nitrocellulose. Far- 
Western blots were also obtained for crude HeLa nuclear 
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extracts using 32 P-labeled CREB, phosphorylated with PK-A or 
casein kinase II (CKII) . Far-Western blot analysis was 
also conducted with immunoprecipitates prepared from HeLa 
nuclear extracts with control IgG or affinity purified CBP 
5 antiserum (CBP-Ab) . CREB binding activity was detected 
with 32 P- labeled CREB phosphorylated with PK-A. 

Using the above-described antiserum, a 2 65 kD 
polypeptide was detected on Western blots, as predicted 
from the cDNA (see Chrivia et al., supra), which coincided 
with the predominant phospho-CREB binding activity in HeLa 
nuclear extracts by "Far-Western" blot assay. An identical 
phospho-CREB binding activity was also found in NIH3T3 
cells. This phospho-CREB binding protein appeared to be 
specific for Serl3 3 phosphorylated CREB because no such 
band was detected with CREB labeled to the same specific 
activity at a non-regulatory phospho-acceptor site (Serl56) 
by casein kinase II (CKII) (see Hagiwara et al., Cell 
70:105-113 (1992), which is incorporated herein by 
reference) . 

To further demonstrate that the major phospho- 
CREB binding protein in HeLa and NIH3T3 cells is 
specifically bound by the anti-CBP antibody, 
immunoprecipitates were prepared from crude nuclear 
extracts using the CBP antiserum. Far-Western analysis of 
these immunoprecipitates revealed a 2 65 kD band in samples 
incubated with CBP antiserum, but not with control IgG. 

EXAMPLE II 

Role of Phosphorylation in CREB-CBP Interaction 

To examine whether the phosphorylation dependent 
30 interaction between CREB and CBP was critical for cAMP 
responsive transcription, a microinjection assay was 
employed using CBP antiserum, which would be predicted to 
impair formation of a CREB-CBP complex. Thus, NIH3T3 cells 
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were cultured in 5% C0 2 atmosphere in Dulbecco's Modified 
Eagle's Medium (DMEM) , supplemented with 10% fetal calf 
serum. Forty-eight hours prior to injection, cells were 
passaged into scored glass coverslips and made quiescent by 
5 incubation in medium containing 0.05% fetal calf serum for 
24 hours (see, for example, Hagikara et al., supra and 
Alberts et al r , in Mol . and Cell Biol. 13:2104-2112 (1993), 
both incorporated herein by reference) . Representative 
fields of NIH3T3 cells were injected with pCRE-lacZ 

10 reporter plasmid plus 5, 0.5, and 0.05 mg/ml of affinity 
purified CBP antiserum. Total antibody concentration in 
microinjected cells was maintained at 5 mg/ml by adjusting 
with control Rabbit IgG. Injected cells were stimulated 
with 0.5 mM 8-bromo-cAMP , plus 3-isobutyl-l-methylxanthine 

15 (IBMX) for 4 hours, then fixed and assayed for lacZ 
activity (/?-Gal) as well as antibody content (Texas Red 
anti-Rb) . 

Following microinjection into nuclei of NIH3T3 
cells, a CRE-lacZ- reporter was markedly induced by 
2 0 treatment with 8-bromo-cAMP plus IBMX. Co-injection of CBP 
antiserum with the CRE-lacZ plasmid inhibited cAMP 
dependent activity in a dosage-dependent manner, but 
control IgG had no effect on this response. 

To determine whether CBP antiserum inhibited cAMP 
25 responsive transcription by binding specifically to CBP, 
peptide blocking experiments were performed. Thus, the 
effect of CBP antiserum on CRE-lacZ reporter activity 
following pre-treatment of CBP antiserum with synthetic CBP 
peptide (anti-CBP+CBP) or unrelated peptide (anti-CBP+ILS; 
30 the unrelated peptide, ILS , is described by Leonard et al., 
in Mol. Endocr. 7: 12 75-1283 (1993) , which is incorporated 
herein by reference) was determined. Rabbit IgG+CBP and 
rabbit IgG pre-treated with CBP peptide were used as 
controls. NIH3T3 cells were injected with CRE-lacZ 

35 reporter plus various CBP antisera, stimulated with 0.5 mM 
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8-bromo-cAMP, plus IBMX for 4 hours, and assayed for lacZ 
activity. Cells expressing the lacZ gene product form a 
blue precipitate upon X-gal staining, which quenches 
immunof luorescent detection of the injected antibody. 

5 CBP antiserum, pre-incubated with synthetic CBP 

peptide, was unable to recognize the 265 kD CBP product on 
a Western blot; and could not inhibit CRE-lacZ reporter 
activity upon microinjection into NIH3T3 cells. But 
antiserum treated with an unrelated synthetic peptide (ILS) 
10 retained full activity in both Western and microinjection 
assay, suggesting that the ability of the antiserum to bind 
CBP was critical for its inhibitory effect on cAMP 
dependent transcription. 

Results of these experiments are summarized in 

15 Figure 1. 

EXAMPLE III 
Multip le Signalling Pathways Utilize CBP 

To determine whether CBP activity may be 
restricted to a subset of promoters, several constitutively 
20 active reporter constructs were tested: 
Cytomegalovirus (CMV-lacZ) , 
Rous sarcoma virus (RSV-lacZ) , and 
SV40 (SV40-lacZ) . 
Thus, cells were microinjected with CBP antiserum plus Rous 
25 Sarcoma Virus (pRSV-lacZ) or Cytomegalovirus (pCMV-lacZ) 
reporter constructs . Alternatively , NIH3T3 cells 

microinjected with CBP antiserum (or non-specific rabbit 
IgG (RblgG) ) , plus reporter constructs containing either 
cAMP responsive elements (pCRS-lacZ) , serum responsive 
30 elements (pSRE-lacZ) or phorbol ester responsive elements 
(pTRE-lacZ) . Light field photo-micrographs show cells 
stained for £-galactosidase activity following four hour 
treatment with either 0.5 roM 8-bromo-cAMP, plus IBMX (pCRE- 
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lacZ) , 20% fetal calf serum (pSRE-lacZ) , or 200ng/ml TPA 
(pTRE-lacZ) . Results of £-galactosidase assays are 

summarized in Figure 2. Dark field photos show 

microinjected IgGs as visualized by immunofluorescence 
5 using Texas Red donkey anti-rabbit IgG. 

When examined in NIH3T3 cells by transient 
transfection assay, each of the constitutively active 
reporter constructs had comparable basal activity, relative 
to the cAMP-stimulated CRE reporter plasmid, thereby 

10 permitting the effects of CBP antiserum on these reporters 
to be compared directly. Although co-injected CBP 
antiserum could block cAMP stimulated activity from a CRE- 
lacZ reporter in contemporaneous assays, no inhibition was 
observed on basal expression from any of the constitutive 

15 promoter constructs tested, even when 10-fold lower amounts 
of reporter plasmid were employed. 

These results suggest that CBP can indeed 
discriminate between basal and signal dependent activities 
in vivo. 

2 0 EXAMPLE IV 

CBP-involvement in non-CREB mediated pathways 

Previous reports showing that serum and phorbol 
esters stimulate their target genes through 
phosphorylat ion-dependent trans-activators (see, for 
25 example, Hill et al., in Cell 73:395-406 (1993) or Smeal et 
al., in Nature 354:494-496 (1991), both incorporated herein 
by reference) , suggested that CBP might also function in 
these signaling pathways. Thus, Far-Western analyses were 
carried out with crude HeLa nuclear extracts using 32 P- 

3 0 labeled recombinant Jun protein phosphorylated in vitro 

with either Jun-kinase (JNK; see Hibi et al. , in Genes and 
Develop. 7:2135-2148 (1993), incorporated herein by 
reference) or casein kinase II (CK II) . 



WO 95/21940 



PCT/US95/01325 



20 

Whereas serum and TPA could stimulate reporter 
activity in NIH3T3 cells microin jected with serum 
responsive element (SRE)-lacZ and TPA-responsive element 
(TRE)-lacZ indicator plasmids, respectively, co-injected 
5 CBP antiserum completely blocked both responses. These 
results suggest that CBP not only interacts with CREB, but 
also with other signal-dependent transcription factors. 

In this regard, phorbol esters and serum induce 
TRE-dependent transcription, in part, through the Jun- 

10 kinase (JNK) mediated phosphorylation of c-Jun at Ser63 and 
Ser73 (see , for example, Smeal et al . , supra or Hibi et 
al. , supra). Using 32 P-labeled recombinant c-Jun protein, 
phosphorylated at Ser63 and Ser7 3 with JNK, Far-Western 
blot assays were performed on crude HeLa nuclear extracts. 

15 JNK-phosphorylated c-Jun protein could bind CBP with 
comparable affinity to CREB. But c-Jun labeled to similar 
specific activity at non-activating sites (Thr 231, Ser243, 
and Ser249; see Boyle et al., in Cell 64:573-584 (1991)) by 
CKII, could not interact with CBP, suggesting that 

2 0 interaction between CBP and c-Jun requires phosphorylation 

of the transcriptionally active Ser63 and Ser73 phospho- 
acceptor sites. In view of the inhibitory effect of CBP 
antiserum on TRE-/? gal reporter expression following 
phorbol ester and serum induction, the phosphorylation 
25 dependent interaction between CBP and c-Jun would appear to 
be a critical component of these response pathways. 

EXAMPLE V 
Chromatographic purification of CBP 

Based on the surprising discovery that CBP 

3 0 cooperates with phosphorylation dependent activators by 

recruiting general transcription factors to target 
promoters , it was next examined whether CBP would co- 
fractionate with any general factors in HeLa nuclear 
extracts. Thus, Far-Western analyses of protein fractions 



WO 95/21940 



PCT/US95/01325 



21 

were obtained after phospho-cellulose chromatography. 
Phospho-CREB binding proteins were visualized using 
32 P-labeled CREB phosphorylated in vitro with PK-A 

32 

( P-CREB) . Western analysis was carried out with the same 
5 blot as described above, using affinity purified CBP 
antibody (CBP Ab) . Far-Western ( 32 P-CREB) and Western 
(CBP-Ab) analyses of fractions were also carried out 
following DEAE and DE52 chromatography. Phosphocellulose, 
DEAE, and DE52 chromatography was performed on HeLa nuclear 
10 extracts as described by Ferreri et al., in Proc . Natl. 
Acad. Sci. USA in press (1993), which is incorporated 
herein by reference. 

Both CBP-immunoreactive and phospho-CREB binding 
activities were retained on phosphocellulose columns and 

15 were eluted at 0.3-0.5M KC1. Further purification of a 
comparable phospho-cellulose fraction on DEAE-sepharose and 
DE52 resins showed that CBP was highly enriched in 
fractions containing TFI " (E, F, H) but not TFIID 
activities. Although the general factor which associates 

2 0 directly with CBP is not known, the co-f ractionation of -CBP 
with proteins involved in basal transcription initiation 
suggests a testable mechanism for CBP action. In 
particular, the results presented herein suggest that 
phosphorylation-dependent activators like CREB and Jun 

25 influence assembly of late-acting factors (TFII E, F, H) 
during transcriptional initiation/reinitiation by 
interacting with CBP in a signal dependent manner. 

While the invention has been described in detail 
with reference to certain preferred embodiments thereof, it 
30 will be understood that modifications and variations are 
within the spirit and scope of that which is described and 
claimed. 
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ATG GCC GAG AAC TTG CTG GAC GGA CCG CCC AAC CCC AAA CGA GCC AAA 48 
Met Ala Glu Asn Leu Leu Asp Gly Pro Pro Asn Pro Lys Arg Ala Lys 
15 10 15 
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CTC AGC TCG CCC GGC TTC TCC GCG AAT GAC AAC ACA GAT TTT GGA TCA 96 
Leu Ser Ser Pro Gly Phe Ser Ala Asn Asp Asn Thr Asp Phe Gly Ser 
20 25 30 

TTG TTT GAC TTG GAA AAT GAC CTT CCT GAT GAG CTG ATC CCC AAT GGA 144 
Leu Phe Asp Leu Glu Asn Asp Leu Pro Asp Glu Leu lie Pro Asn Gly 
35 40 45 

GAA TTA AGC CTT TTA AAC AGT GGG AAC CTT GTT CCA GAT GCT GCG TCC 192 
Glu Leu Ser Leu Leu Asn Ser Gly Asn Leu Val Pro Asp Ala Ala Ser 
50 55 60 

AAA CAT AAA CAA CTG TCA GAG CTT CTT AGA GGA GGC AGC GGC TCT AGC 240 
Lys His Lys Gin Leu Ser Glu Leu Leu Arg Gly Gly Ser Gly Ser Ser 
65 70 75 80 

ATC AAC CCA GGG ATA GGC AAT GTG AGT GCC AGC AGC CCT GTG CAA CAG 288 
lie Asn Pro Gly lie Gly Asn Val Ser Ala Ser Ser Pro Val Gin Gin 
85 90 95 

GGC CTT GGT GGC CAG GCT CAG GGG CAG CCG AAC AGT ACA AAC ATG GCC 336 
Gly Leu Gly Gly Gin Ala Gin Gly Gin Pro Asn Ser Thr Asn Met Ala 
100 105 110 

AGC TTA GGT GCC ATG GGC AAG AGC CCT CTG AAC CAA GGA GAC TCA TCA 384 
Ser Leu Gly Ala Met Gly Lys Ser Pro Leu Asn Gin Gly Asp Ser Ser 
115 120 125 

ACA CCC AAC CTG CCC AAA CAG GCA GCC AGC ACC TCT GGG CCC ACT CCC 432 
Thr Pro Asn Leu Pro Lys Gin Ala Ala Ser Thr Ser Gly Pro Thr Pro 
130 135 140 

CCT GCC TCC CAA GCA CTG AAT CCA CAA GCA CAA AAG CAA GTA GGG CTG 480 
Pro Ala Ser Gin Ala Leu Asn Pro Gin Ala Gin Lys Gin Val Gly Leu 
145 150 155 160 

GTG ACC AGT AGT CCT GCC ACA TCA CAG ACT GGA CCT GGG ATC TGC ATG 528 
Val Thr Ser Ser Pro Ala Thr Ser Gin Thr Gly Pro Gly lie Cys Met 
165 170 175 

AAT GCT AAC TTC AAC CAG ACC CAC CCA GGC CTT CTC AAT AGT AAC TCT 576 
Asn Ala Asn Phe Asn Gin Thr His Pro Gly Leu Leu Asn Ser Asn Ser 
180 185 190 

GGC CAT AGC TTA ATG AAT CAG GCT CAA CAA GGG CAA GCT CAA GTC ATG 624 
Gly His Ser Leu Met Asn Gin Ala Gin Gin Gly Gin Ala Gin Val Met 
195 200 205 

AAT GGA TCT CTT GGG GCT GCT GGA AGA GGA AGG GGA GCT GGA ATG CCC 672 
Asn Gly Ser Leu Gly Ala Ala Gly Arg Gly Arg Gly Ala Gly Met Pro 
210 215 220 

TAC CCT GCT CCA GCC ATG CAG GGG GCC ACA AGC AGT GTG CTG GCG GAG 720 
Tyr Pro Ala Pro Ala Met Gin Gly Ala Thr Ser Ser Val Leu Ala Glu 
225 230 235 240 

ACC TTG ACA CAG GTT TCC CCA CAA ATG GCT GGC CAT GCT GGA CTA AAT 768 
Thr Leu Thr Gin Val Ser Pro Gin Met Ala Gly His Ala Gly Leu Asn 
245 250 255 

ACA GCA CAG GCA GGA GGC ATG ACC AAG ATG GGA ATG ACT GGT ACC ACA 816 
Thr Ala Gin Ala Gly Gly Met Thr Lys Met Gly Met Thr Gly Thr Thr 
260 265 270 

AGT CCA TTT GGA CAA CCC TTT AGT CAA ACT GGA GGG CAG CAG ATG GGA 864 
Ser Pro Phe Gly Gin Pro Phe Ser Gin Thr Gly Gly Gin Gin Met Gly 
275 280 285 
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GCC ACT GGA GTG AAC CCC CAG TTA GCC AGC AAA CAG AGC ATG GTC AAT 912 
Ala Thr Gly Val Asn Pro Gin Leu Ala Ser Lys Gin Ser Met Val Asn 
290 295 300 

AGT TTA CCT GCT TTT CCT ACA GAT ATC AAG AAT ACT TCA GTC ACC ACT 960 
Ser Leu Pro Ala Phe Pro Thr Asp lie Lys Asn Thr Ser Val Thr Thr 
305 310 315 320 

GTG CCA AAT ATG TCC CAG TTG CAA ACA TCA GTG GGA ATT GTA CCC ACA 1008 
Val Pro Asn Met Ser Gin Leu Gin Thr Ser Val Gly lie Val Pro Thr 
325 330 335 

CAA GCA ATT GCA ACA GGC CCC ACA GCA GAC CCT GAA AAA CGC AAA CTG 1056 
Gin Ala lie Ala Thr Gly Pro Thr Ala Asp Pro Glu Lys Arg Lys Leu 
340 345 350 

ATA CAG CAG CAG CTG GTT CTA CTG CTT CAT GCC CAC AAA TGT CAG AGA 1104 
lie Gin Gin Gin Leu Val Leu Leu Leu His Ala His Lys Cys Gin Arg 
355 360 365 

CGA GAG CAA GCA AAT GGA GAG GTT CGG GCC TGT TCT CTC CCA CAC TGT 1152 
Arg Glu Gin Ala Asn Gly Glu Val Arg Ala Cys Ser Leu Pro His Cys 
370 375 380 

CGA ACC ATG AAA AAC GTT TTG AAT CAC ATG ACA CAT TGT CAG GCT CCC 1200 
Arg Thr Met Lys Asn Val Leu Asn His Met Thr His Cys Gin Ala Pro 
385 390 395 400 

AAA GCC TGC CAA GTT GCC CAT TGT GCA TCT TCA CGA CAA ATC ATC TCT 12 48 
Lys Ala Cys Gin Val Ala His Cys Ala Ser Ser Arg Gin lie lie Ser 
405 410 415 

CAT TGG AAG AAC TGC ACA CGA CAT GAC TGT CCT GTT TGC CTC CCT TTG 1296 
His Trp Lys Asn Cys Thr Arg His Asp Cys Pro Val Cys Leu Pro Leu 
420 425 430 

AAA AAT GCC AGT GAC AAG CGA AAC CAA CAA ACC ATC CTG GGA TCT CCA 1344 
Lys Asn Ala Ser Asp Lys Arg Asn Gin Gin Thr lie Leu Gly Ser Pro 
435 440 445 

GCT AGT GGA ATT CAA AAC ACA ATT GGT TCT GTT GGT GCA GGG CAA CAG 1392 
Ala Ser Gly lie Gin Asn Thr lie Gly Ser Val Gly Ala Gly Gin Gin 
450 455 460 

AAT GCC ACT TCC TTA AGT AAC CCA AAT CCC ATA GAC CCC AGT TCC ATG 1440 
Asn Ala Thr Ser Leu Ser Asn Pro Asn Pro lie Asp Pro Ser Ser Met 
465 470 475 480 

CAG CGG GCC TAT GCT GCT CTA GGA CTC CCC TAC ATG AAC CAG CCT CAG 1488 
Gin Arg Ala Tyr Ala Ala Leu Gly Leu Pro Tyr Met Asn Gin Pro Gin 
485 490 495 

ACG CAG CTG CAG CCT CAG GTT CCT GGC CAG CAA CCA GCA CAG CCT CCA 1536 
Thr Gin Leu Gin Pro Gin Val Pro Gly Gin Gin Pro Ala Gin Pro Pro 
500 505 510 

GCC CAC CAG CAG ATG AGG ACT CTC AAT GCC CTA GGA AAC AAC CCC ATG 1584 
Ala His Gin Gin Met Arg Thr Leu Asn Ala Leu Gly Asn Asn Pro Met 
515 520 525 

AGT GTC CCA GCA GGA GGA ATA ACA ACA GAT CAA CAG CCA CCA AAC TTG 1632 
Ser Val Pro Ala Gly Gly lie Thr Thr Asp Gin Gin Pro Pro Asn Leu 
530 535 540 

ATT TCA GAA TCA GCT CTT CCA ACT TCC TTG GGG GCT ACC AAT CCA CTG 1680 
lie Ser Glu Ser Ala Leu Pro Thr Ser Leu Gly Ala Thr Asn Pro Leu 
545 550 555 560 
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ATG AAT GAT GGT TCA AAC TCT GGT AAC ATT GGA AGC CTC AGC ACG ATA 1728 
Met Asn Asp Gly Ser Asn Ser Gly Asn lie Gly Ser Leu Ser Thr lie 
565 570 575 

CCT ACA GCA GCG CCT CCT TCC AGC ACT GGT GTT CGA AAA GGC TGG CAT 1776 
Pro Thr Ala Ala Pro Pro Ser Ser Thr Gly Val Arg Lys Gly Trp His 
580 585 590 

GAA CAT GTG ACT CAG GAC CTA CGG AGT CAT CTA GTC CAT AAA CTC GTT 1824 
Glu His Val Thr Gin Asp Leu Arg Ser His Leu Val His Lys Leu Val 
595 600 605 

CAA GCC ATC TTC CCA ACT CCA GAC CCT GCA GCT CTG AAA GAT CGC CGC 1872 
Gin Ala lie Phe Pro Thr Pro Asp Pro Ala Ala Leu Lys Asp Arg Arg 
610 615 620 

ATG GAG AAC CTG GTT GCC TAT GCT AAG AAA GTG GAG GGA GAC ATG TAT 1920 
Met Glu Asn Leu Val Ala Tyr Ala Lys Lys Val Glu Gly Asp Met Tyr 
625 630 635 640 

GAG TCT GCT AAT AGC AGG GAT GAA TAC TAT CAT TTA TTA GCA GAG AAA 1968 
Glu Ser Ala Asn Ser Arg Asp Glu Tyr Tyr His Leu Leu Ala Glu Lys 

645 .. 650 655 

ATC TAT AAA ATA CAA AAA GAA CTA GAA GAA AAG CGG AGG ACA CGT TTA 2016 T 
lie Tyr Lys lie Gin Lys Glu Leu Glu Glu Lys Arg Arg Thr Arg Leu 
660 665 670 

CAT AAG CAA GGC ATC CTG GGT AAC CAG CCA GCT TTA CCA GCT TCT GGG 2064' 
His Lys Gin Gly lie Leu Gly Asn Gin Pro Ala Leu Pro Ala Ser Gly 
675 680 685 

GCT CAG CCC CCT GTG ATT CCA CCA GCC CAG TCT GTA AG A CCT CCA AAT 2112 
Ala Gin Pro Pro Val lie Pro Pro Ala Gin Ser Val Arg Pro Pro Asn 
690 695 700 

GGG CCC CTG CCT TTG CCA GTG AAT CGC ATG CAG GTT TCT CAA GGG ATG 2160 r 
Gly Pro Leu Pro Leu Pro Val Asn Arg Met Gin Val Ser Gin Gly Met 
705 710 715 720 

AAT TCA TTT AAC CCA ATG TCC CTG GGA AAC GTC CAG TTG CCA CAG GCA 2208 
Asn Ser Phe Asn Pro Met Ser Leu Gly Asn Val Gin Leu Pro Gin Ala 
725 730 735 

CCC ATG GGA CCT CGT GCA GCC TCC CCT ATG AAC CAC TCT GTG CAG ATG 225 6 
Pro Met Gly Pro Arg Ala Ala Ser Pro Met Asn His Ser Val Gin Met 
740 745 750 

AAC AGC ATG GCC TCA GTT CCG GGT ATG GCC ATT TCT CCT TCA CGG ATG 2304 
Asn Ser Met Ala Ser Val Pro Gly Met Ala lie Ser Pro Ser Arg Met 
755 760 765 

CCT CAG CCT CCA AAT ATG ATG GGC ACT CAT GCC AAC AAC ATT ATG GCC 2352 
Pro Gin Pro Pro Asn Met Met Gly Thr His Ala Asn Asn lie Met Ala 
770 775 780 

CAG GCA CCT ACT CAG AAC CAG TTT CTG CCA CAG AAC CAG TTT CCA TCA 2 400 
Gin Ala Pro Thr Gin Asn Gin Phe Leu Pro Gin Asn Gin Phe Pro Ser 
785 790 795 800 

TCC AGT GGG GCA ATG AGT GTG AAC AGT GTG GGC ATG GGG CAA CCA GCA 2448 
Ser Ser Gly Ala Met Ser Val Asn Ser Val Gly Met Gly Gin Pro Ala 
805 810 815 

GCC CAG GCA GGT GTT TCA CAG GGT CAG GAA CCT GGA GCT GCT CTC CCT 249 6 
Ala Gin Ala Gly Val Ser Gin Gly Gin Glu Pro Gly Ala Ala Leu Pro 
820 825 830 
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AAC CCT CTG AAC ATG CTG GCA CCC CAG GCC AGC CAG CTG CCT TGC CCA 2 544 
Asn Pro Leu Asn Met Leu Ala Pro Gin Ala Ser Gin Leu Pro Cys Pro 
835 840 845 

CCA GTG ACA CAG TCA CCA TTG CAC CCG ACT CCA CCT CCT GCT TCC ACA 2592 
Pro Val Thr Gin Ser Pro Leu His Pro Thr Pro Pro Pro Ala Ser Thr 
850 855 860 

GCT GCT GGC ATG CCC TCT CTC CAA CAT CCA ACG GCA CCA GGA ATG ACC 2640 
Ala Ala Gly Met Pro Ser Leu Gin His Pro Thr Ala Pro Gly Met Thr 
865 870 875 880 

CCT CCT CAG CCA GCA GCT CCC ACT CAG CCA TCT ACT CCT GTG TCA TCT 2688 
Pro Pro Gin Pro Ala Ala Pro Thr Gin Pro Ser Thr Pro Val Ser Ser 
885 890 895 

GGG CAG ACT CCT ACC CCA ACT CCT GGC TCA GTG CCC AGC GCT GCC CAA 2736 
Gly Gin Thr Pro Thr Pro Thr Pro Gly Ser Val Pro Ser Ala Ala Gin 
900 905 910 

ACA CAG AGT ACC CCT ACA GTC CAG GCA GCA GCA CAG GCT CAG GTG ACT 2784 
Thr Gin Ser Thr Pro Thr Val Gin Ala Ala Ala Gin Ala Gin Val Thr 

915 920 925 

CCA CAG CCT CAG ACC CCA GTG CAG CCA CCA TCT GTG GCT ACT CCT CAG 2832 
Pro Gin Pro Gin Thr Pro Val Gin Pro Pro Ser Val Ala Thr Pro Gin 
930 935 940 

TCA TCA CAG CAG CAA CCA ACG CCT GTG CAT ACT CAG CCA CCT GGC ACA 2880 
Ser Ser Gin Gin Gin Pro Thr Pro Val His Thr Gin Pro Pro Gly Thr 
945 950 955 960 

CCG CTT TCT CAG GCA GCA GCC AGC ATT GAT AAT AGA GTC CCT ACT CCC 2928 
Pro Leu Ser Gin Ala Ala Ala Ser lie Asp Asn Arg Val Pro Thr Pro 
965 970 975 

TCC ACT GTG ACC AGT GCT GAA ACC AGT TCC CAG CAG CCA GGA CCC GAT 2976 
Ser Thr Val Thr Ser Ala Glu Thr Ser Ser Gin Gin Pro Gly Pro Asp 
980 985 990 

GTG CCC ATG CTG GAA ATG AAG ACA GAG GTG CAG ACA GAT GAT GCT GAG 3024 
Val Pro Met Leu Glu Met Lys Thr Glu Val Gin Thr Asp Asp Ala Glu 
995 1000 1005 

CCT GAA CCT ACT GAA TCC AAG GGG GAA CCT CGG TCT GAG ATG ATG GAA 3072 
Pro Glu Pro Thr Glu Ser Lys Gly Glu Pro Arg Ser Glu Met Met Glu 
1010 1015 1020 

GAG GAT TTA CAA GGT TCT TCC CAA GTA AAA GAA GAG ACA GAT ACG ACA 3120 
Glu Asp Leu Gin Gly Ser Ser Gin Val Lys Glu Glu Thr Asp Thr Thr 
1025 1030 1035 1040 

GAG CAG AAG TCA GAG CCA ATG GAA GTA GAA GAA AAG AAA CCT GAA GTA 3168 
Glu Gin Lys Ser Glu Pro Met Glu Val Glu Glu Lys Lys Pro Glu Val 
1045 1050 1055 

AAA GTG GAA GCT AAA GAG GAA GAA GAG AAC AGT TCG AAC GAC ACA GCC 3216 
Lys Val Glu Ala Lys Glu Glu Glu Glu Asn Ser Ser Asn Asp Thr Ala 
1060 1065 1070 

TCA CAA TCA ACA TCT CCT TCC CAG CCA CGC AAA AAA ATC TTT AAA CCC 32 64 
Ser Gin Ser Thr Ser Pro Ser Gin Pro Arg Lys Lys lie Phe Lys Pro . 
1075 1080 1085 

GAG GAG CTA CGC CAG GCA CTT ATG CCA ACT CTA GAA GCA CTC TAT CGA 3312 
Glu Glu Leu Arg Gin Ala Leu Met Pro Thr Leu Glu Ala Leu Tyr Arg 
1090 1095 1100 
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CAG GAC CCA GAG TCT TTG CCT TTT CGT CAG CCT GTA GAT CCT CAG CTC 
Gin Asp Pro Glu Ser Leu Pro Phe Arg Gin Pro Val Asp Pro Gin Leu 
1105 1110 1115 1120 



3360 



CTA GGA ATC CCA GAT TAT TTT GAT ATA GTG AAG AAT CCT ATG GAC CTT 
Leu Gly lie Pro Asp Tyr Phe Asp lie Val Lys Asn Pro Met Asp Leu 
1125 1130 1135 



3408 



TCT ACC ATC AAA CGA AAG CTG GAC ACA GGG CAA TAT CAA GAA CCC TGG 
Ser Thr lie Lys Arg Lys Leu Asp Thr Gly Gin Tyr Gin Glu Pro Trp 
1140 1145 1150 



3456 



CAG TAT GTG GAT GAT GTC AGG CTT ATG TTC AAC AAT GCG TGG CTA TAT 
Gin Tyr Val Asp Asp Val Arg Leu Met Phe Asn Asn Ala Trp Leu Tyr 
1155 1160 1165 



3504 



AAT CGT AAA ACG TCC CGT GTA TAT AAA TTT TGC AGT AAA CTT GCA GAG 
Asn Arg Lys Thr Ser Arg Val Tyr Lys Phe Cys Ser Lys Leu Ala Glu 
1170 1175 1180 



3552 



GTC TTT GAA CAA GAA ATT GAC CCT GTC ATG CAG TCT CTT GGA TAT TGC 
Val Phe Glu Gin Glu lie Asp Pro Val Met Gin Ser Leu Gly Tyr Cys 
1185 1190 1195 1200 



3600 



TGT GGA CGA AAG TAT GAG TTC TCC CCA CAG ACT TTG TGC TGT TAC GGA 
Cys Gly Arg Lys Tyr Glu Phe Ser Pro Gin Thr Leu Cys Cys Tyr Gly 
1205 1210 1215 



3648 



AAG CAG CTG TGT ACA ATT CCT CGT GAT GCA GCC TAC TAC AGC TAT CAG 
Lys Gin Leu Cys Thr lie Pro Arg Asp Ala Ala Tyr Tyr Ser Tyr Gin 
1220 1225 1230 



3696 



AAT AGG TAT CAT TTC TGT GGG AAG TGT TTC ACA GAG ATC CAG GGC GAG 
Asn Arg Tyr His Phe Cys Gly Lys Cys Phe Thr Glu lie Gin Gly Glu 
1235 1240 1245 



3744 



AAT GTG ACC CTG GGT GAC GAC CCT TCC CAA CCT CAG ACG ACA ATT TCC 
Asn Val Thr Leu Gly Asp Asp Pro Ser Gin Pro Gin Thr Thr lie Ser 
1250 1255 1260 



3792 



AAG GAT CAA TTT GAA AAG AAG AAA AAT GAT ACC TTA GAT CCT GAA CCT 
Lys Asp Gin Phe Glu Lys Lys Lys Asn Asp Thr Leu Asp Pro Glu Pro 
1265 1270 1275 1280 



3840 



TTT GTT GAC TGC AAA GAG TGT GGC CGG AAG ATG CAT CAG ATT TGT GTT 
Phe Val Asp Cys Lys Glu Cys Gly Arg Lys Met His Gin lie Cys Val 
1285 1290 1295 



3888 



CTA CAC TAT GAC ATC ATT TGG CCT TCA GGT TTT GTG TGT GAC AAC TGT 
Leu His Tyr Asp lie lie Trp Pro Ser Gly Phe Val Cys Asp Asn Cys 
1300 1305 1310 



3936 



TTG AAG AAA ACT GGC AGA CCT CGG AAA GAA AAC AAA TTC AGT GCT AAG 
Leu Lys Lys Thr Gly Arg Pro Arg Lys Glu Asn Lys Phe Ser Ala Lys 
1315 1320 1325 



3984 



AGG CTG CAG ACC ACA CGA TTG GGA AAC CAC TTA GAA GAC AGA GTG AAT 
Arg Leu Gin Thr Thr Arg Leu Gly Asn His Leu Glu Asp Arg Val Asn 
1330 1335 1340 



4032 



AAG TTT TTG CGG CGC CAG AAT CAC CCT GAA GCT GGG GAG GTT TTT GTC 
Lys Phe Leu Arg Arg Gin Asn His Pro Glu Ala Gly Glu Val Phe Val 
1345 1350 1355 1360 



4080 



AGA GTG GTG GCC AGC TCA GAC AAG ACT GTG GAG GTC AAG CCG GGA ATG 
Arg Val Val Ala Ser Ser Asp Lys Thr Val Glu Val Lys Pro Gly Met 
1365 1370 1375 



4128 
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AAG TCA AGG TTT GTG GAT TCT GGA GAG ATG TCG GAA TCT TTC CCA TAT 417 6 
Lys Ser Arg Phe Val Asp Ser Gly Glu Met Ser Glu Ser Phe Pro Tyr 
1380 1385 1390 

CGT ACC AAA GCA CTC TTT GCT TTT GAG GAG ATC GAT GGA GTC GAT GTG 4224 
Arg Thr Lys Ala Leu Phe Ala Phe Glu Glu lie Asp Gly Val Asp Val 
1395 1400 1405 

TGC TTT TTT GGG ATG CAT GTG CAA GAT ACG GCT CTG ATT GCC CCC CAC .4272 
Cys Phe Phe Gly Met His Val Gin Asp Thr Ala Leu lie Ala Pro His 
1410 1415 1420 

CAA ATA CAA GGC TGT GTA TAG ATA TCT TAT CTG GAC AGT ATT CAT TTC 4320 
Gin lie Gin Gly Cys Val Tyr lie Ser Tyr Leu Asp Ser He His Phe 
1425 1430 1435 1440 

TTC CGG CCC CGC TGC CTC CGG ACA GCT GTT TAC CAT GAG ATC CTC ATC 4368 
Phe Arg Pro Arg Cys Leu Arg Thr Ala Val Tyr His Glu He Leu He 
1445 1450 1455 

GGA TAT CTC GAG TAT GTG AAG AAA TTG GTG TAT GTG ACA GCA CAT ATT 4416 
Gly Tyr Leu Glu Tyr Val Lys Lys Leu Val Tyr Val Thr Ala His He 
1460 1465 1470 

TGG GCC TGT CCC CCA AGT GAA GGA GAT GAC TAT ATC TTT CAT TGC CAC 4464 
Trp Ala Cys Pro Pro Ser Glu Gly Asp Asp Tyr He Phe His Cys His 
1475 1480 1485 

CCC CCT GAC CAG AAA ATC CCC AAA CCA AAA CGA CTA CAG GAG TGG TAC 4512 
Pro Pro Asp Gin Lys He Pro Lys Pro Lys Arg Leu Gin Glu Trp Tyr 
1490 1495 1500 

AAG AAG ATG CTG GAC AAG GCG TTT GCA GAG AGG ATC ATT AAC GAC TAT 4560 
Lys Lys Met Leu Asp Lys Ala Phe Ala Glu Arg He He Asn Asp Tyr 
1505 1510 1515 1520 

AAG GAC ATC TTC AAA CAA GCG AAC GAA GAC AGG CTC ACG AGT GCC AAG 4608 
Lys Asp He Phe Lys Gin Ala Asn Glu Asp Arg Leu Thr Ser Ala Lys 
1525 1530 1535 

GAG TTG CCC TAT TTT GAA GGA GAT TTC TGG CCT AAT GTG TTG GAA GAA 4656 
Glu Leu Pro Tyr Phe Glu Gly Asp Phe Trp Pro Asn Val Leu Glu Glu 
1540 1545 1550 

AGC ATT AAG GAA CTA GAA CAA GAA GAA GAA GAA AGG AAA AAA GAA GAG 4704 
Ser He Lys Glu Leu Glu Gin Glu Glu Glu Glu Arg Lys Lys Glu Glu 
1555 1560 1565 

AGT ACT GCA GCG AGT GAG ACT CCT GAG GGC AGT CAG GGT GAC AGC AAA 4752 
Ser Thr Ala Ala Ser Glu Thr Pro Glu Gly Ser Gin Gly Asp Ser Lys 
1570 1575 1580 

AAT GCG AAG AAA AAG AAC AAC AAG AAG ACC AAC AAA AAC AAA AGC AGC 4800 
Asn Ala Lys Lys Lys Asn Asn Lys Lys Thr Asn Lys Asn Lys Ser Ser 
1585 1590 1595 1600 

ATT AGC CGC GCC AAC AAG AAG AAG CCC AGC ATG CCC AAT GTT TCC AAC 4848 
He Ser Arg Ala Asn Lys Lys Lys Pro Ser Met Pro Asn Val Ser Asn 
1605 1610 1615 

GAC CTG TCG CAG AAG CTG TAT GCC ACC ATG GAG AAG CAC AAG GAG GTA 4896 
Asp Leu Ser Gin Lys Leu Tyr Ala Thr Met Glu Lys His Lys Glu Val 
1620 1625 1630 

TTC TTT GTG ATT CAT CTG CAT GCT GGG CCT GTT ATC AGC ACT CAG CCC 4944 
Phe Phe Val He His Leu His Ala Gly Pro Val He Ser Thr Gin Pro 
1635 1640 1645 
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CCC ATC GTG GAC CCT GAT OCT CTG CTT AGC TGT GAC CTC ATG GAT GGG 4992 

Pro lie Val Asp Pro Asp Pro Leu Leu Ser Cys Asp Leu Met Asp Gly 
1650 1655 1660 

CGA GAT GCC TTC CTC ACC CTG GCC AGA GAC AAG CAC TGG GAA TTC TCT 5040 
Arg Asp Ala Phe Leu Thr Leu Ala Arg Asp Lys His Trp Glu Phe Ser 
1665 1670 1675 1680 

TCC TTA CGC CGC TCC AAA TGG TCC ACT CTG TGC ATG CTG GTG GAG CTG 5088 
Ser Leu Arg Arg Ser Lys Trp Ser Thr Leu Cys Met Leu Val Glu Leu 
1685 1690 1695 

CAC ACA CAG GGC CAG GAC CGC TTT GTT TAT ACC TGC AAT GAG TGC AAA 5136 
His Thr Gin Gly Gin Asp Arg Phe Val Tyr Thr Cys Asn Glu Cys Lys 
1700 1705 1710 

CAC CAT GTG GAA ACA CGC TGG CAC TGC ACT GTG TGT GAG GAC TAT GAC 5184 
His His Val Glu Thr Arg Trp His Cys Thr Val Cys Glu Asp Tyr Asp 
1715 1720 1725 

CTT TGT ATC AAT TGC TAC AAC ACA AAG AGC CAC ACC CAT AAG ATG GTG 5232 
Leu Cys lie Asn Cys Tyr Asn Thr Lys Ser His Thr His Lys Met Val 
1730 1735 1740 

AAG TGG GGG CTA GGC CTA GAT GAT GAG GGC AGC AGT CAG GGT GAG CCA 5280" 
Lys Trp Gly Leu Gly Leu Asp Asp Glu Gly Ser Ser Gin Gly Glu Pro 
1745 1750 1755 1760 

CAG TCC AAG AGC CCC CAG GAA TCC CGG CGT CTC AGC ATC CAG CGC TGC 5328" 
Gin Ser Lys Ser Pro Gin Glu Ser Arg Arg Leu Ser lie Gin Arg Cys 
1765 1770 1775 

ATC CAG TCC CTG GTG CAT GCC TGC CAG TGT CGC AAT GCC AAC TGC TCA 5376 
lie Gin Ser Leu Val His Ala Cys Gin Cys Arg Asn Ala Asn Cys Ser 
1780 1785 1790 

CTG CCG TCT TGC CAG AAG ATG AAG CGA GTC GTG CAG CAC ACC AAG GGC 5424". 
Leu Pro Ser Cys Gin Lys Met Lys Arg Val Val Gin His Thr Lys Gly ' 
1795 1800 1805 

TGC AAG CGC AAG ACT AAT GGA GGA TGC CCA GTG TGC AAG CAG CTC ATT 5472 
Cys Lys Arg Lys Thr Asn Gly Gly Cys Pro Val Cys Lys Gin Leu lie 
1810 1815 1820 

GCT CTT TGC TGC TAC CAC GCC AAA CAC TGC CAA GAA AAT AAA TGC CCT 5520 
Ala Leu Cys Cys Tyr His Ala Lys His Cys Gin Glu Asn Lys Cys Pro 
1825 1830 1835 1840 

GTG CCC TTC TGC CTC AAC ATC AAA CAT AAC GTC CGC CAG CAG CAG ATC 5568 
Val Pro Phe Cys Leu Asn lie Lys His Asn Val Arg Gin Gin Gin lie 
1845 1850 1855 

CAG CAC TGC CTG CAG CAG GCT CAG CTC ATG CGC CGG CGA ATG GCA ACC 5616 
Gin His Cys Leu Gin Gin Ala Gin Leu Met Arg Arg Arg Met Ala Thr 
1860 1865 1870 

ATG AAC ACC CGC AAT GTG CCT CAG CAG AGT TTG CCT TCT CCT ACC TCA 5 664 
Met Asn Thr Arg Asn Val Pro Gin Gin Ser Leu Pro Ser Pro Thr Ser 
1875 1880 1885 

GCA CCA CCC GGG ACT CCT ACA CAG CAG CCC AGC ACA CCC CAA ACA CCA 5712 
Ala Pro Pro Gly Thr Pro Thr Gin Gin Pro Ser Thr Pro Gin Thr Pro 
1890 1895 1900 

CAG CCC CCA GCC CAG CCT CAG CCT TCA CCT GTT AAC ATG TCA CCA GCA 5760 
Gin Pro Pro Ala Gin Pro Gin Pro Ser Pro Val Asn Met Ser Pro Ala 
1905 1910 1915 1920 
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GGC TTC CCT AAT GTA GCC CGG ACT CAG CCC CCA ACA ATA GTG TCT GCT 5808 
Gly Phe Pro Asn Val Ala Arg Thr Gin Pro Pro Thr lie Val Ser Ala 
1925 1930 1935 

GGG AAG CCT ACC AAC CAG GTG CCA GCT CCC CCA CCC CCT GCC CAG CCC 585 6 
Gly Lys Pro Thr Asn Gin Val Pro Ala Pro Pro Pro Pro Ala Gin Pro 
1940 1945 1950 

CCA CCT GCA GCA GTA GAA GCA GCC CGG CAA ATT GAA CGT GAG GCC CAG 5904 
Pro Pro Ala Ala Val Glu Ala Ala Arg Gin lie Glu Arg Glu Ala Gin 
1955 1960 1965 

CAG CAG CAG CAC CTA TAC CGA GCA AAC ATC AAC AAT GGC ATG CCC CCA 5952 
Gin Gin Gin His Leu Tyr Arg Ala Asn lie Asn Asn Gly Met Pro Pro 
1970 1975 1980 

GGA CGT GAC GGT ATG GGG ACC CCA GGA AGC CAA ATG ACT CCT GTG GGC 6000 
Gly Arg Asp Gly Met Gly Thr Pro Gly Ser Gin Met Thr Pro. Val Gly 
1985 1990 1995 2000 

CTG AAT GTG CCC CGT CCC AAC CAA GTC AGT GGG CCT GTC ATG TCT AGT 6048 
Leu Asn Val Pro Arg Pro Asn Gin Val Ser Gly Pro Val Met Ser Ser 
2005 2010 2015 

ATG CCA CCT GGG CAG TGG CAG CAG GCA CCC ATC CCT CAG CAG CAG CCG 6096 
Met Pro Pro Gly Gin Trp Gin Gin Ala Pro lie Pro Gin Gin Gin Pro 
2020 2025 2030 

ATG CCA GGC ATG CCC AGG CCT GTA ATG TCC ATG CAG GCC CAG GCA GCA 6144 
Met Pro Gly Met Pro Arg Pro Val Met Ser Met Gin Ala Gin Ala Ala 
2035 2040 2045 

GTG GCT GGG CCA CGG ATG CCC AAT GTG CAG CCA AAC AGG AGC ATC TCG 6192 
Val Ala Gly Pro Arg Met Pro Asn Val Gin Pro Asn Arg Ser lie Ser 
2050 2055 2060 

CCA AGT GCC CTG CAA GAC CTG CTA CGG ACC CTA AAG TCA CCC AGC TCT 6240 
Pro Ser Ala Leu Gin Asp Leu Leu Arg Thr Leu Lys Ser Pro Ser Ser 
2065 2070 2075 2080 

CCT CAG CAG CAG CAG CAG GTG CTG AAC ATC CTT AAA TCA AAC CCA CAG 6288 
Pro Gin Gin Gin Gin Gin Val Leu Asn lie Leu Lys Ser Asn Pro Gin 
2085 2090 2095 

CTA ATG GCA GCT TTC ATC AAA CAG CGC ACA GCC AAG TAT GTG GCC AAT 6336 
Leu Met Ala Ala Phe lie Lys Gin Arg Thr Ala Lys Tyr Val Ala Asn 
2100 2105 2110 

CAG CCT GGC ATG CAG CCC CAG CCC GGA CTT CAA TCC CAG CCT GGT ATG 6384 
Gin Pro Gly Met Gin Pro Gin Pro Gly Leu Gin Ser Gin Pro Gly Met 
2115 2120 2125 

CAG CCC CAG CCT GGC ATG CAC CAG CAG CCT AGT TTG CAA AAC CTG AAC 6432 
Gin Pro Gin Pro Gly Met His Gin Gin Pro Ser Leu Gin Asn Leu Asn 
2130 2135 2140 

GCA ATG CAA GCT GGT GTG CCA CGG CCT GGT GTG CCT CCA CCA CAA CCA 6480 
Ala Met Gin Ala Gly Val Pro Arg Pro Gly Val Pro Pro Pro Gin Pro 
2145 2150 2155 2160 

GCA ATG GGA GGC CTG AAT CCC CAG GGA CAA GCT CTG AAC ATC ATG AAC 6528 
Ala Met Gly Gly Leu Asn Pro Gin Gly Gin Ala Leu Asn lie Met Asn 
2165 2170 2175 

CCA GGA CAC AAC CCC AAC ATG ACA AAC ATG AAT CCA CAG TAC CGA GAA 6576 
Pro Gly His Asn Pro Asn Met Thr Asn Met Asn Pro Gin Tyr Arg Glu 
2180 2185 2190 
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ATG 
Met 


GTG 
Val 


AGG AGA 
Arg Arg 
2195 


CAG 
Gin 


CTG 
Leu 


CTA 
Leu 


CAG CAC 
Gin His 
2200 


CAG 
Gin 


CAG 
Gin 


CAG 
Gin 


CAG CAG 
Gin Gin 
2205 


CAA 
Gin 


CAG 
Gin 


6624 


CAG 
Gin 


CAG CAG 
Gin Gin 
2210 


CAG 
Gin 


CAG 
Gin 


CAA 
Gin 


CAA CAA 
Gin Gin 
2215 


AAT 
Asn 


AGT 
Ser 


GCC 
Ala 


AGC TTG 
Ser Leu 
2220 


GCC 
Ala 


GGG 
Gly 


GGC 
Gly 


6672 


ATG GCG 
Met Ala 
2225 


GGA 
Gly 


CAC 
His 


AGC 
Ser 


CAG TTC 
Gin Phe 
2230 


CAG 
Gin 


CAG 
Gin 


CCA 
Pro 


CAA GGA 
Gin Gly 
2235 


CCT 
Pro 


GGA 
Gly 


GGT 
Gly 


TAT 
Tyr 
2240 


6720 


GCC 
Ala 


CCA 
Pro 


GCC 
Ala 


ATG 
Met 


CAG CAG 
Gin Gin 
2245 


CAA 
Gin 


CGC 
Arg 


ATG 
Met 


CAA CAG 
Gin Gin 
2250 


CAC 
His 


CTC 
Leu 


CCC 
Pro 


ATC CAG 
He Gin 
2255 


6768 


GGC 
Gly 


AGC 

Ser 


TCC 
Ser 


ATG GGC 
Met Gly 
2260 


CAG 
Gin 


ATG 
Met 


GCT 
Ala 


GCT CCA 
Ala Pro 
2265 


ATG 
Met 


GGA 
Gly 


CAA 

Gin 


CTT GGC 
Leu Gly 
2270 


CAG 
Gin 


6816 


ATG 
Met 


GGG 
Gly 


CAG CCT 
Gin Pro 
2275 


GGG 
Gly 


CTA 
Leu 


GGG 

Gly 


GCA GAC 
Ala Asp 
2280 


AGC 
Ser 


ACC 
Thr 


CCT 
Pro 


AAT ATC 
Asn lie 
2285 


CAG 
Gin 


CAG 
Gin 


6864 


GCC 
Ala 


CTG CAG 
Leu Gin 
2290 


CAA 
Gin 


CGG 
Arg 


ATT 
lie 


CTG CAG 
Leu Gin 
2295 


CAG 
Gin 


CAG 
Gin 


CAG 
Gin 


ATG AAG 
Met Lys 
2300 


CAA 
Gin 


CAA 
Gin 


ATT 
He 


6912 


GGG TCA 
Gly Ser 
2305 


CCA 

Pro 


GGC 
Gly 


CAG 
Gin 


CCG AAC 
Pro Asn 
2310 


CCC 
Pro 


ATG 
Met 


AGC 
Ser 


CCC CAG 
Pro Gin 
2315 


CAG 
Gin 


CAC 
His 


ATG 
Met 


CTC 
Leu 
2320 


6960 


TCA 
Ser 


GGA 
Gly 


CAG 
Gin 


CCA 
Pro 


CAG GCC 
Gin Ala 
2325 


TCA 
Ser 


CAT 
His 


CTC 
Leu 


CCT GGC 
Pro Gly 
2330 


CAG 
Gin 


CAG 
Gin 


ATC 
He 


GCC ACA 
Ala Thr 
2335 


7008 


TCC 
Ser 


CTT 
Leu 


AGT 

Ser 


AAC CAG 
Asn Gin 
2340 


GTG 
Val 


CGA 
Arg 


TCT 
Ser 


CCA GCC 
Pro Ala 
2345 


CCT 
Pro 


GTG 
Val 


CAG 
Gin 


TCT CCA 
Ser Pro 
2350 


CGG 
Arg 


7056 


CCC 
Pro 


CAA 
Gin 


TCC CAA 
Ser Gin 
2355 


CCT 
Pro 


CCA 
Pro 


CAT 
His 


TCC AGC 
Ser Ser 
2360 


CCG 
Pro 


TCA 
Ser 


CCA 
Pro 


CGG ATA 
Arg lie 
2365 


CAA 
Gin 


CCC 
Pro 


7104 


CAG 
Gin 


CCT TCA 
Pro Ser 
2370 


CCA 
Pro 


CAC 
His 


CAT 
His 


GTT TCA 
Val Ser 
2375 


CCC 
Pro 


CAG 
Gin 


ACT 
Thr 


GGA ACC 
Gly Thr 
2380 


CCT 
Pro 


CAC 

His 


CCT 
Pro 


7152 


GGA CTC 
Gly Leu 
2385 


GCA 
Ala 


GTC 
Val 


ACC 
Thr 


ATG GCC 
Met Ala 
2390 


AGC 
Ser 


TCC 
Ser 


ATG 
Met 


GAT CAG 
Asp Gin 
2395 


GGA 
Gly 


CAC 
His 


CTG 
Leu 


GGG 
Gly 
2400 


7200 


AAC 
Asn 


CCT 
Pro 


GAA 
Glu 


CAG 
Gin 


AGT GCA 
Ser Ala 
2405 


ATG 
Met 


CTC 
Leu 


CCC 
Pro 


CAG CTG 
Gin Leu 
2410 


AAT 
Asn 


ACC 
Thr 


CCC 
Pro 


AAC AGG 
Asn Arg 
2415 


7248 


AGC 
Ser 


GCA 
Ala 


CTG 
Leu 


TCC AGT 
Ser Ser 
2420 


GAA 
Glu 


CTG 
Leu 


TCC 
Ser 


CTG GTT 
Leu Val 
2425 


GGT 
Gly 


GAT 
Asp 


ACC 
Thr 


ACG GGA 
Thr Gly 
2430 


GAC 
Asp 


7296 


ACA 


CTA 


GAA 


AAG 


TTT 


GTG 


GAG 


GGT 


TTG 


TAG 














7326 



Thr Leu Glu Lys Phe Val Glu Gly Leu 
2435 2440 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2441 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Glu Asn Leu Leu Asp Gly Pro Pro Asn Pro Lys Arg Ala Lys 
1 5 10 15 

Leu Ser Ser Pro Gly Phe Ser Ala Asn Asp Asn Thr Asp Phe Gly Ser 
20 25 30 

Leu Phe Asp Leu Glu Asn Asp Leu Pro Asp Glu Leu lie Pro Asn Gly 
35 40 45 

Glu Leu Ser Leu Leu Asn Ser Gly Asn Leu Val Pro Asp Ala Ala Ser 
50 55 60 

Lys His Lys Gin Leu Ser Glu Leu Leu Arg Gly Gly Ser Gly Ser Ser 
65 70 75 80 

lie Asn Pro Gly lie Gly Asn Val Ser Ala Ser Ser Pro Val Gin Gin 
85 90 95 

Gly Leu Gly Gly Gin Ala Gin Gly Gin Pro Asn Ser Thr Asn Met Ala 
100 105 110 

Ser Leu Gly Ala Met Gly Lys Ser Pro Leu Asn Gin Gly Asp Ser Ser 
115 120 125 

Thr Pro Asn Leu Pro Lys Gin Ala Ala Ser Thr Ser Gly Pro Thr Pro 
130 135 140 

Pro Ala Ser Gin Ala Leu Asn Pro Gin Ala Gin Lys Gin Val Gly Leu 
145 150 155 160 

Val Thr Ser Ser Pro Ala Thr Ser Gin Thr Gly Pro Gly lie Cys Met 
165 170 175 

Asn Ala Asn Phe Asn Gin Thr His Pro Gly Leu Leu Asn Ser Asn Ser 
180 185 190 

Gly His Ser Leu Met Asn Gin Ala Gin Gin Gly Gin Ala Gin Val Met 
195 200 205 

Asn Gly Ser Leu Gly Ala Ala Gly Arg Gly Arg Gly Ala Gly Met Pro 
210 215 220 

Tyr Pro Ala Pro Ala Met Gin Gly Ala Thr Ser Ser Val Leu Ala Glu 
225 230 235 240 

Thr Leu Thr Gin Val Ser Pro Gin Met Ala Gly His Ala Gly Leu Asn 
245 250 255 

Thr Ala Gin Ala Gly Gly Met Thr Lys Met Gly Met Thr Gly Thr Thr 
260 265 270 

Ser Pro Phe Gly Gin Pro Phe Ser Gin Thr Gly Gly Gin Gin Met Gly 
275 280 285 

Ala Thr Gly Val Asn Pro Gin Leu Ala Ser Lys Gin Ser Met Val Asn 
290 295 300 
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Ser Leu Pro Ala Phe Pro Thr Asp lie Lys Asn Thr Ser Val Thr Thr 
305 310 315 320 

Val Pro Asn Met Ser Gin Leu Gin Thr Ser Val Gly lie Val Pro Thr 
325 330 335 

Gin Ala lie Ala Thr Gly Pro Thr Ala Asp Pro Glu Lys Arg Lys Leu 
340 345 350 

lie Gin Gin Gin Leu Val Leu Leu Leu His Ala His Lys Cys Gin Arg 
355 360 365 

Arg Glu Gin Ala Asn Gly Glu Val Arg Ala Cys Ser Leu Pro His Cys 
370 375 380 

Arg Thr Met Lys Asn Val Leu Asn His Met Thr His Cys Gin Ala Pro 
385 390 395 400 

Lys Ala Cys Gin Val Ala His Cys Ala Ser Ser Arg Gin lie lie Ser 
405 410 415 

His Trp Lys Asn Cys Thr Arg His Asp Cys Pro Val Cys Leu Pro Leu 
420 425 430 

Lys Asn Ala Ser Asp Lys Arg Asn Gin Gin Thr lie Leu Gly Ser Pro 
435 440 445 

Ala Ser Gly lie Gin Asn Thr He Gly Ser Val Gly Ala Gly Gin Gin 
450 455 460 

Asn Ala Thr Ser Leu Ser Asn Pro Asn Pro He Asp Pro Ser Ser Met 
465 470 475 480 

Gin Arg Ala Tyr Ala Ala Leu Gly Leu Pro Tyr Met Asn Gin Pro Gin 
485 490 495 

Thr Gin Leu Gin Pro Gin Val Pro Gly Gin Gin Pro Ala Gin Pro Pro 

500 505 510 

Ala His Gin Gin Met Arg Thr Leu Asn Ala Leu Gly Asn Asn Pro Met 
515 520 525 

Ser Val Pro Ala Gly Gly lie Thr Thr Asp Gin Gin Pro Pro Asn Leu 
530 535 540 

He Ser Glu Ser Ala Leu Pro Thr Ser Leu Gly Ala Thr Asn Pro Leu 
545 550 555 560 

Met Asn Asp Gly Ser Asn Ser Gly Asn He Gly Ser Leu Ser Thr He 

565 570 575 

Pro Thr Ala Ala Pro Pro Ser Ser Thr Gly Val Arg Lys Gly Trp His 
580 585 590 

Glu His Val Thr Gin Asp Leu Arg Ser His Leu Val His Lys Leu Val 
595 600 605 

Gin Ala He Phe Pro Thr Pro Asp Pro Ala Ala Leu Lys Asp Arg Arg 
610 615 620 

Met Glu Asn Leu Val Ala Tyr Ala Lys Lys Val Glu Gly Asp Met Tyr 
625 630 635 640 

Glu Ser Ala Asn Ser Arg Asp Glu Tyr Tyr His Leu Leu Ala Glu Lys 
645 650 655 
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lie Tyr Lys lie Gin Lys Glu Leu Glu Glu Lys Arg Arg Thr Arg Leu 
660 665 670 

His Lys Gin Gly lie Leu Gly Asn Gin Pro Ala Leu Pro Ala Ser Gly 
675 680 685 

Ala Gin Pro Pro Val lie Pro Pro Ala Gin Ser Val Arg Pro Pro Asn 
690 695 700 

Gly Pro Leu Pro Leu Pro Val Asn Arg Met Gin Val Ser Gin Gly Met 
705 710 715 720 

Asn Ser Phe Asn Pro Met Ser Leu Gly Asn Val Gin Leu Pro Gin Ala 
725 730 735 

Pro Met Gly Pro Arg Ala Ala Ser Pro Met Asn His Ser Val Gin Met 
740 745 750 

Asn Ser Met Ala Ser Val Pro Gly Met Ala lie Ser Pro Ser Arg Met 
755 760 765 

Pro Gin Pro Pro Asn Met Met Gly Thr His Ala Asn Asn lie Met Ala 
770 775 780 

Gin Ala Pro Thr Gin Asn Gin Phe Leu Pro Gin Asn Gin Phe Pro Ser 
785 790 795 800 

Ser Ser Gly Ala Met Ser Val Asn Ser Val Gly Met Gly Gin Pro Ala 
805 810 815 

Ala Gin Ala Gly Val Ser Gin Gly Gin Glu Pro Gly Ala Ala Leu Pro 
820 825 830 

Asn Pro Leu Asn Met Leu Ala Pro Gin Ala Ser Gin Leu Pro Cys Pro 
835 840 845 

Pro Val Thr Gin Ser Pro Leu His Pro Thr Pro Pro Pro Ala Ser Thr 
850 855 860 

Ala Ala Gly Met Pro Ser Leu Gin His Pro Thr Ala Pro Gly Met Thr 
865 870 875 880 

Pro Pro Gin Pro Ala Ala Pro Thr Gin Pro Ser Thr Pro Val Ser Ser 
885 890 895 

Gly Gin Thr Pro Thr Pro Thr Pro Gly Ser Val Pro Ser Ala Ala Gin 
900 905 910 

Thr Gin Ser Thr Pro Thr Val Gin Ala Ala Ala Gin Ala Gin Val Thr 
915 920 925 

Pro Gin Pro Gin Thr Pro Val Gin Pro Pro Ser Val Ala Thr Pro Gin 
930 935 940 

Ser Ser Gin Gin Gin Pro Thr Pro Val His Thr Gin Pro Pro Gly Thr 
945 950 955 . 960 

Pro Leu Ser Gin Ala Ala Ala Ser lie Asp Asn Arg Val Pro Thr Pro 
965 970 975 

Ser Thr Val Thr Ser Ala Glu Thr Ser Ser Gin Gin Pro Gly Pro Asp 
980 985 990 

Val Pro Met Leu Glu Met Lys Thr Glu Val Gin Thr Asp Asp Ala Glu 
995 1000 1005 
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Pro Glu Pro Thr Glu Ser Lys Gly Glu Pro Arg Ser Glu Met Met Glu 
1010 1015 1020 

Glu Asp Leu Gin Gly Ser Ser Gin Val Lys Glu Glu Thr Asp Thr Thr 
1025 1030 1035 1040 

Glu Gin Lys Ser Glu Pro Met Glu Val Glu Glu Lys Lys Pro Glu Val 
1045 1050 1055 

Lys Val Glu Ala Lys Glu Glu Glu Glu Asn Ser Ser Asn Asp Thr Ala 
1060 1065 1070 

Ser Gin Ser Thr Ser Pro Ser Gin Pro Arg Lys Lys lie Phe Lys Pro 
1075 1080 1085 

Glu Glu Leu Arg Gin Ala Leu Met Pro Thr Leu Glu Ala Leu Tyr Arg 
1090 1095 1100 

Gin Asp Pro Glu Ser Leu Pro Phe Arg Gin Pro Val Asp Pro Gin Leu 
1105 1110- 1115 1120 

Leu Gly lie Pro Asp Tyr Phe Asp lie Val Lys Asn Pro Met Asp Leu 
1125 1130 1135 

Ser Thr lie Lys Arg Lys Leu Asp Thr Gly Gin Tyr Gin Glu Pro Trp 
1140 1145 1150 

Gin Tyr Val Asp Asp Val Arg Leu Met Phe Asn Asn Ala Trp Leu Tyr 
1155 1160 1165 

Asn Arg Lys Thr Ser Arg Val Tyr Lys Phe Cys Ser Lys Leu Ala Glu 
1170 1175 1180 

Val Phe Glu Gin Glu lie Asp Pro Val Met Gin Ser Leu Gly Tyr Cys 
1185 1190 1195 1200 

Cys Gly Arg Lys Tyr Glu Phe Ser Pro Gin Thr Leu Cys Cys Tyr Gly 
1205 1210 1215 

Lys Gin Leu Cys Thr lie Pro Arg Asp Ala Ala Tyr Tyr Ser Tyr Gin 
1220 1225 1230 

Asn Arg Tyr His Phe Cys Gly Lys Cys Phe Thr Glu lie Gin Gly Glu 
1235 1240 1245 

Asn Val Thr Leu Gly Asp Asp Pro Ser Gin Pro Gin Thr Thr lie Ser 
1250 1255 1260 

Lys Asp Gin Phe Glu Lys Lys Lys Asn Asp Thr Leu Asp Pro Glu Pro 
1265 1270 1275 1280 

Phe Val Asp Cys Lys Glu Cys Gly Arg Lys Met His Gin lie Cys Val 
1285 1290 1295 

Leu His Tyr Asp lie lie Trp Pro Ser Gly Phe Val Cys Asp Asn Cys 
1300 1305 1310 

Leu Lys Lys Thr Gly Arg Pro Arg Lys Glu Asn Lys Phe Ser Ala Lys 
1315 1320 1325 

Arg Leu Gin Thr Thr Arg Leu Gly Asn His Leu Glu Asp Arg Val Asn 
1330 1335 1340 

Lys Phe Leu Arg Arg Gin Asn His Pro Glu Ala Gly Glu Val Phe Val 
1345 1350 1355 1360 
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Arg Val Val Ala Ser Ser Asp Lys Thr Val Glu Val Lys Pro Gly Met 
1365 1370 1375 

Lys Ser Arg Phe Val Asp Ser Gly Glu Met Ser Glu Ser Phe Pro Tyr 
1380 1385 1390 

Arg Thr Lys Ala Leu Phe Ala Phe Glu Glu lie Asp Gly Val Asp Val 
1395 1400 1405 

Cys Phe Phe Gly Met His Val Gin Asp Thr Ala Leu lie Ala Pro His 
1410 1415 1420 

Gin lie Gin Gly Cys Val Tyr lie Ser Tyr Leu Asp Ser lie His Phe 
1425 1430 1435 1440 

Phe Arg Pro Arg Cys Leu Arg Thr Ala Val Tyr His Glu lie Leu lie 
1445 1450 1455 

Gly Tyr Leu Glu Tyr Val Lys Lys Leu Val Tyr Val Thr Ala His lie 
1460 1465 1470 

Trp Ala Cys Pro Pro Ser Glu Gly Asp Asp Tyr lie Phe His Cys His 
1475 1480 1485 

Pro Pro Asp Gin Lys lie Pro Lys Pro Lys Arg Leu Gin Glu Trp Tyr 
1490 1495 1500 

Lys Lys Met Leu Asp Lys Ala Phe Ala Glu Arg lie lie Asn Asp Tyr 
1505 1510 1515 1520 

Lys Asp lie Phe Lys Gin Ala Asn Glu Asp Arg Leu Thr Ser Ala Lys 
1525 1530 1535 

Glu Leu Pro Tyr Phe Glu Gly Asp Phe Trp Pro Asn Val Leu Glu Glu 
1540 1545 1550 

Ser lie Lys Glu Leu Glu Gin Glu Glu Glu Glu Arg Lys Lys Glu Glu 
1555 1560 1565 

Ser Thr Ala Ala Ser Glu Thr Pro Glu Gly Ser Gin Gly Asp Ser Lys 
1570 1575 1580 

Asn Ala Lys Lys Lys Asn Asn Lys Lys Thr Asn Lys Asn Lys Ser Ser 
1585 1590 1595 1600 

lie Ser Arg Ala Asn Lys Lys Lys Pro Ser Met Pro Asn Val Ser Asn 
1605 1610 1615 

Asp Leu Ser Gin Lys Leu Tyr Ala Thr Met Glu Lys His Lys Glu Val 
1620 1625 1630 

Phe Phe Val lie His Leu His Ala Gly Pro Val lie Ser Thr Gin Pro 
1635 1640 1645 

Pro lie Val Asp Pro Asp Pro Leu Leu Ser Cys Asp Leu Met Asp Gly 
1650 1655 1660 

Arg Asp Ala Phe Leu Thr Leu Ala Arg Asp Lys His Trp Glu Phe Ser 
1665 1670 1675 1680 

Ser Leu Arg Arg Ser Lys Trp Ser Thr Leu Cys Met Leu Val Glu Leu 
1685 1690 1695 

His Thr Gin Gly Gin Asp Arg Phe Val Tyr Thr Cys Asn Glu Cys Lys 
1700 1705 1710 
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His His Val Glu Thr Arg Trp His Cys Thr Val Cys Glu Asp Tyr Asp 
1715 1720 1725 

Leu Cys lie Asn Cys Tyr Asn Thr Lys Ser His Thr His Lys Met Val 
1730 1735 1740 

Lys Trp Gly Leu Gly Leu Asp Asp Glu Gly Ser Ser Gin Gly Glu Pro 
1745 1750 1755 1760 

Gin Ser Lys Ser Pro Gin Glu Ser Arg Arg Leu Ser lie Gin Arg Cys 
1765 1770 1775 

lie Gin Ser Leu Val His Ala Cys Gin Cys Arg Asn Ala Asn Cys Ser 
1780 1785 1790 

Leu Pro Ser Cys Gin Lys Met Lys Arg Val Val Gin His Thr Lys Gly 
1795 1800 1805 

Cys Lys Arg Lys Thr Asn Gly Gly Cys Pro Val Cys Lys Gin Leu lie 
1810 1815 1820 

Ala Leu Cys Cys Tyr His Ala Lys His Cys Gin Glu Asn Lys Cys Pro 
1825 1830 1835 1840 

Val Pro Phe Cys Leu Asn lie Lys His Asn Val Arg Gin Gin Gin lie 
1845 1850 1855 

Gin His Cys Leu Gin Gin Ala Gin Leu Met Arg Arg Arg Met Ala Thr 
1860 1865 1870 

Met Asn Thr Arg Asn Val Pro Gin Gin Ser Leu Pro Ser Pro Thr Ser 
1875 1880 1885 

Ala Pro Pro Gly Thr Pro Thr Gin Gin Pro Ser Thr Pro Gin Thr Pro 
1890 1895 1900 

Gin Pro Pro Ala Gin Pro Gin Pro Ser Pro Val Asn Met Ser Pro Ala 
1905 1910 1915 1920 

Gly Phe Pro Asn Val Ala Arg Thr Gin Pro Pro Thr lie Val Ser Ala 
1925 1930 1935 

Gly Lys Pro Thr Asn Gin Val Pro Ala Pro Pro Pro Pro Ala Gin Pro 
1940 1945 1950 

Pro Pro Ala Ala Val Glu Ala Ala Arg Gin lie Glu Arg Glu Ala Gin 
1955 1960 1965 

Gin Gin Gin His Leu Tyr Arg Ala Asn lie Asn Asn Gly Met Pro Pro 
1970 1975 1980 

Gly Arg Asp Gly Met Gly Thr Pro Gly Ser Gin Met Thr Pro Val Gly 
1985 1990 1995 2000 

Leu Asn Val Pro Arg Pro Asn Gin Val Ser Gly Pro Val Met Ser Ser 
2005 2010 2015 

Met Pro Pro Gly Gin Trp Gin Gin Ala Pro lie Pro Gin Gin Gin Pro 
2020 2025 . 2030 

Met Pro Gly Met Pro Arg Pro Val Met Ser Met Gin Ala Gin Ala Ala 
2035 2040 2045 

Val Ala Gly Pro Arg Met Pro Asn Val Gin Pro Asn Arg Ser lie Ser 
2050 2055 2060 
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Pro Ser Ala Leu Gin Asp Leu Leu Arg Thr Leu Lys Ser Pro Ser Ser 
2065 2070 2075 2080 

Pro Gin Gin Gin Gin Gin Val Leu Asn lie Leu Lys Ser Asn Pro Gin 
2085 2090 2095 

Leu Met Ala Ala Phe lie Lys Gin Arg Thr Ala Lys Tyr Val Ala Asn 
2100 2105 2110 

Gin Pro Gly Met Gin Pro Gin Pro Gly Leu Gin Ser Gin Pro Gly Met 
2115 2120 2125 

Gin Pro Gin Pro Gly Met His Gin Gin Pro Ser Leu Gin Asn Leu Asn 
2130 2135 2140 

Ala Met Gin Ala Gly Val Pro Arg Pro Gly Val Pro Pro Pro Gin Pro 
2145 2150 2155 2160 

Ala Met Gly Gly Leu Asn Pro Gin Gly Gin Ala Leu Asn lie Met Asn 
2165 2170 2175 

Pro Gly His Asn Pro Asn Met Thr Asn Met Asn Pro Gin Tyr Arg Glu 
2180 2185 2190 

Met Val Arg Arg Gin Leu Leu Gin His Gin Gin Gin Gin Gin Gin Gin 
2195 2200 2205 

Gin Gin Gin Gin Gin Gin Gin Gin Asn Ser Ala Ser Leu Ala Gly Gly 
2210 2215 2220 

Met Ala Gly His Ser Gin Phe Gin Gin Pro Gin Gly Pro Gly Gly Tyr 
2225 2230 2235 2240 

Ala Pro Ala Met Gin Gin Gin Arg Met Gin Gin His Leu Pro lie Gin 
2245 2250 2255 

Gly Ser Ser Met Gly Gin Met Ala Ala Pro Met Gly Gin Leu Gly Gin 
2260 2265 2270 

Met Gly Gin Pro Gly Leu Gly Ala Asp Ser Thr Pro Asn lie Gin Gin 
2275 2280 2285 

Ala Leu Gin Gin Arg lie Leu Gin Gin Gin Gin Met Lys Gin Gin lie 
2290 2295 2300 

Gly Ser Pro Gly Gin Pro Asn Pro Met Ser Pro Gin Gin His Met Leu 
2305 2310 2315 2320 

Ser Gly Gin Pro Gin Ala Ser His Leu Pro Gly Gin Gin lie Ala Thr 
2325 2330 2335 

Ser Leu Ser Asn Gin Val Arg Ser Pro Ala Pro Val Gin Ser Pro Arg 
2340 2345 2350 

Pro Gin Ser Gin Pro Pro His Ser Ser Pro Ser Pro Arg lie Gin Pro 
2355 2360 2365 

Gin Pro Ser Pro His His Val Ser Pro Gin Thr Gly Thr Pro His Pro 
2370 2375 2380 

Gly Leu Ala Val Thr Met Ala Ser Ser Met Asp Gin Gly His Leu Gly 
2385 2390 2395 2400 

Asn Pro Glu Gin Ser Ala Met Leu Pro Gin Leu Asn Thr Pro Asn Arg 
2405 2410 2415 
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Ser Ala Leu Ser Ser Glu Leu Ser Leu Val Gly Asp Thr Thr Gly Asp 
2420 2425 2430 

Thr Leu Glu Lys Phe Val Glu Gly Leu 
2435 2440 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Val Glu Gly Asp Met Tyr Glu Ser Ala Asn Ser Arg Asp Glu 
5 10 15 
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That which is claimed is: 

1. A method for the identification of a compound 
which inhibits activation of cAMP and mitogen responsive 
genes, said method comprising: 

monitoring expression of reporter in response to 
exposure to .said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
out in the presence of : 

a signal dependent transcription 
factor, 

a polypeptide comprising at least 
amino acid residues 461-661 of 
the protein set forth in SEQ ID 
NO : 2 , and 
a reporter construct; 
wherein said reporter construct comprises a 
reporter gene under the control of a signal 
dependent transcription factor. 

2 . A method according to claim 1 wherein said 
signal dependent transcription factor is a phosphorylation 
dependent activator . 

3. A method according to claim 1 wherein said 
signal dependent transcription factor is selected from Jun, 
Fos, serum responsive factor, Elk or steroid hormone 
receptors, 

4. A method according to claim 1 wherein 
expression of said reporter gene is monitored by ELISA, 
immunoblot , immunofluorescence , or immunoprecipitation . 

5. A method according to claim 1 wherein said 
reporter gene encodes lucif erase, ft-galactosidase or 
chloramphenicol transferase. 



10 
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6. A method according to claim 5 wherein said 
reporter construct is selected from CRE-lacZ, SRE-lacZ or 
TRE-lacZ . 

7 . A method according to claim 1 wherein said 
compound is selected from antibodies raised against the 
binding domain of the protein set forth in SEQ ID NO: 2 or 
antibodies raised against the binding domain of CBP-like 

5 compounds. 

8 . A method according to claim 7 wherein said 
antibody is raised against a polypeptide fragment 
comprising amino acid residues 634-648 of the protein set 
forth in SEQ ID NO: 3. 

9 ♦ A method according to claim 7 wherein said 
antibody is raised against a polypeptide fragment 
comprising amino acid residues 461-661 of the protein set 
forth in SEQ ID NO: 2* 

10. A method for the identification of a 
compound which inhibits activation of cAMP and mitogen 
responsive genes, said method comprising: 

(1) contacting a test system with said compound 
5 under physiological conditions; and 

(2) monitoring expression of reporter in 
response to said compound, relative to 
expression of reporter in the absence of 
said compound, wherein said reporter is 

10 encoded by a reporter construct comprising 

a reporter gene under the control of a 
signal dependent transcription factor, and 
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wherein said test system comprises: 

said signal dependent transcription 
15 factor, 

a polypeptide comprising at least 
amino acid residues 461-661 of 
the protein set forth in SEQ ID 
NO : 2 , and 

2 0 said reporter construct. 

11. A method according to claim 10 wherein said 
signal dependent transcription factor is selected from Jun, 
Fos, serum responsive factor, Elk or steroid hormone 
receptors. 

12. A method according to claim 10 wherein 
expression of said reporter gene is monitored by ELISA, 
immunoblot, immunofluorescence or immunoprecipitat ion , 

13. A method according to claim 10 wherein said 
reporter gene encodes lucif erase, 5-galactosidase or 
chloramphenicol transferase . 

14 . A method according to claim 13 wherein said 
reporter construct is selected from CRE-lacZ, SRE-lacZ or 
TRE-lacZ. 

15. A method for the identification of a 
compound which promotes activation of cAMP and mitogen 
responsive genes, said method comprising: 

monitoring expression of reporter in response to 
exposure to said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
out in the presence of : 

a signal dependent transcription 
factor , 
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a polypeptide comprising at least 
amino acid residues 461-661 of 
the protein set forth in SEQ ID 
NO : 2 , and 

15 a reporter construct; 

wherein said reporter construct comprises a 
reporter gene under the control of a signal 
dependent transcription factor. 

16. A method according to claim 15 wherein 
expression of said reporter gene is monitored by ELISA, 
immunoblot , immunof luorescence , or immunoprecipitation . 

17. A method for the identification of a 
compound which has the binding and/ or activation properties 
characteristic of CREB binding protein, said method 
comprising: 

5 monitoring expression of reporter in response to 

exposure to said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
out in the presence of: 
10 a signal dependent transcription 

factor, and 
a reporter construct, 
wherein said reporter construct comprises a 
reporter gene under the control of a signal 
15 dependent transcription factor. 

18. A method according to claim 17 wherein 
expression of said reporter gene is monitored by ELISA, 
immunoblot, immunofluorescence or immunoprecipitation. 

19. A method for the identification of a 
compound which has the transcription activation properties 
characteristic of a signal dependent transcription factor, 
said method comprising: 
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5 monitoring expression of reporter in response to 

exposure to said compound, relative to expression of 
reporter in the absence of said compound, 

wherein exposure to said compound is carried 
out in the presence of: 
10 a polypeptide comprising at least 

amino acid residues 461-661 of 
the protein set forth in SEQ ID 
NO : 2 , and 
a reporter construct, 
15 wherein said reporter construct comprises a 

reporter gene under the control of a signal 
dependent transcription factor. 



20. A method according to claim 19 wherein 
expression of said reporter gene is monitored by ELISA, 
immunoblot , immunof luorescence or immunoprecipitat ion . 
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